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Abstract 

We define a new notion of compressibility of a set of numbers through the dy- 
namics of a polynomial function. We provide approaches to solve the problem 
by reducing it to the multi-criteria traveling salesman problem through a se- 
ries of transformations. We then establish computational complexity results 
by giving some NP-completeness proofs. We also discuss about a notion of e 
K-compressibility of a set, with regard to lossy compression and deduce the 
necessary condition for the given set to be e K-compressible. Finally, we con- 
clude by providing a list of open problems solutions to which could extend the 
applicability the our technique. 
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1. Introduction 

To the best of our knowledge, all the compression techniques existing in the 
literature are static in nature. By static, we refer to the methods which look for 
repetitive patterns in the input data and perform compression by encoding the 
data as codewords. The encoding process typically involves assigning shorter 
codewords to the most repeated patterns and longer codewords for the seldom 
ones. A look-up table stores these pattern-codeword matchings and is trans- 
mitted along with the encoded data. Decoding process comprises of searching 
the look-up table and retrieving the pattern corresponding to the received code- 
word. The number and the length of the codewords are determined by the error 
correcting capability of the associated technique. The most widely used static 
compression technique is the Huffman coding-an entropy encoding algorithm 
used for lossless data compression Number theoretic methods like arith- 
metic encoding Q and range encoding Q schemes have also gained popularity. 
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These entropy based techniques are known to compress messages close to the 
theoretical limit computed via entropy. Other lossless compression techniques 
like the Lempel-Ziv- Welch (LZW) compression [H, H| and its many variants are 
also widely known in literature. They build the dictionary (codewords) by scan- 
ning through the input string for successively longer substrings until they find 
one that is not in the dictionary. 

In many applications, one may encounter scenarios where one would like to 
compress a set as a whole, where the chronological order in which the elements 
appear in the set are irrelevant. It is acceptable as long as the elements con- 
stituting the set are retrieved in some order regardless of the order in which 
they originally featured in the set. Such cases occur quite often especially when 
transmitting large databases over the internet. If we associate a number to 
each tuple representing (say) an employee record, the order in which these tu- 
ples (numbers) are received at the recipient end may not have any significance. 
Even if the positional information is needed, bits indicating the position number 
can be padded to the tuple and then sent. Another case where the positional 
information of data is of very little relevance is in building a cumulative dis- 
tribution from empirical data. These scenarios only demand that the data is 
transmitted as a set and not necessarily as a sequence. 

The existing techniques in the literature seems to be blinded to this distinc- 
tion between the compression a set and that of a sequence. All the aforemen- 
tioned static compression technqiues could very well be used to compressing a 
set, as these methods only focusses on the frequency of the repetitive patterns 
in the input data and hence are invariant to the order in which these patterns 
appear. Unfortunately, the strength of these techniques is also its weakness. 
Namely, their compression efficiency is also invariant to an arbitrary permuta- 
tion of the set and hence remains the same for both a set and for a sequence 
obtained by imposing an order on the set say by a permutation of its elements. 
Other lossy dictionary based techniques like the overcomplete DCT, KSVD 
which tries to compress a given collection of vectors by representing them by a 
corresponding sparse vector built using the dictionary, also relies on an ordering 
imposed on the set, i.e, one need to explicity identify the first component of the 
vector, its second component etc and the compression efficiency will depend on 
the order in which one identifies these components. 

In this work, we introduce a new approach to the compression of a set of 
numbers through the dynamics of a polynomial function. The fundamental 
question that we try to answer is: "What is the computational complexity of 
determining the permutation of the set for which it is maximally compressible?". 
The maximal compression of a set is connected with the degree of the polynomial 
function that is used for compression, a notion that will become clearer as we 
proceed. Before we delve into the intricacies of our technique, let us consider a 
very simple example. Let us assume that we wish to compactly represent the 
set S consisting of first 100 natural numbers, namely S — {1,2,3,..., 99, 100}. 
Static compression techniques may first encode this set in binary and then look 
for patterns in the long sequence of binary digits. Codewords are then assigned 
to each of the patterns to obtain the encoded data. But given this set S, we 
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immediately notice a global structure which these static compression techniques 
are oblivious of, namely the elements of the set S can be produced in succession 
by adding 1 to the previous element. In the terminology of dynamical system, 
this set S can be produced starting from 1 and by repeated composition of the 
function f(x) = x + 1 with itself. Each composition of the function produces 
one element of the set. Hence the most succinct representation of this set is the 
starting element 1 and the coefficients of the iterating function /. 

The set defined above is naive and the structure it exhibits is simple and 
conspicuous. Instead consider the following set of 10 numbers, 

S = {0.0016, 0.3016, 0.0990, 0.9178, 0.15, 0.51, 0.9996, 0.3567, 0.0064, 0.0254} 

which appears to be totally random. But when rearranged, the elements of the 
set form the orbit of the first 10 iterations of the quadratic map f(x) = 4x(l — x) 
starting from 0.15. Hence the arbitrary looking set S can actually be very 
compactly represented by encoding only the four numbers-the triplet (—4, 4, 0) 
corresponding to the coefficients of the quadratic map and the starting element 
0.15. Numerous other examples can be constructed where a random looking set 
can actually be produced by iterating a smaller degree polynomial and hence 
can be efficiently represented. This paper provides ways to test whether a given 
set imports such structures. 

Let us now begin with the formal definition of when we consider a set to be 
intrinsically compressible. 

Definition A set S consisting of N distinct numbers is K- compressible if there 
is exist a polynomial / of degree utmost K and a number x S S, such that 
S={x J(x ),...J N - 1 (xo)}. 

Here /' denotes the composition of / with itself i times. We would like to 
emphasize that the above definition is based on equivalence of sets and not 
on sequence, i.e., the order in which the elements of S occur in the orbit of / 
starting at xq is irrelevant. The immediate question that follows is: "How to 
determine whether a given set is K-compressible?" 

If the sequence information is known, i.e, the sequence in which the elements 
of S occur in the orbit of / are known a-priori, then it is a fairly easy task to 
find out whether the given set is K-compressible. Without loss of generality, 
let x\, X2, ■ ■ ■ , xm be the known sequence. By the unisolvence theorem, there 
exist a unique polynomial / of degree utmost K with the property that f(xi) — 
Xi+i, 1 < i < K + 1. The coefficients of / can easily be computed by inverting 
the corresponding Vandcrmonde matrix (many faster algorithms like Newton 
and Lagrange interpolations do not require to invert the Vandcrmonde matrix). 
Once the coefficients of / are determined, it is a trivial task to check whether 
f(xj) = x j+1 ,K + 2<j<N-l. 

The problem is interesting only the sequence information is unknown and 
only when K < N — 2. When K > N ~ 2, by the above argument we see that 
for each of the possible N\ sequences, there exist a unique / of degree utmost 
N — 2 which produces the numbers in that sequence. Hence the set is trivially 
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K-compressible. When K < N — 2, is it the case that all the ( K N +2 )(K + 2)1 
possible sequences need to considered and verified separately before reaching a 
conclusion? The remaining sections of the paper answers this one question. 

The paper is organized as follows. Section ^ states and proves a prop- 
erty of K-compressible sets. In section ^ we give approaches to check for 
K-compressibility by reducing it to other well known problems through a series 
of equivalent transformations. In section Q we establish computational com- 
plexity results and in section (J5J) we discuss the notion of e K-compressibility 
with regard to lossy compression and deduce the necessary condition for the 
given set to be e K-compressible. We conclude in section © by providing a list 
of open problems, some of which arc interesting from a theoretical perspective. 

2. Property of K-compressible sets 

The following is a useful property of K-compressible sets. 

Lemma 2.1. Every affine transformation of a K-compressible set is K-compressible. 

Proof. Let S — {xi,X2, ■ ■ ■ ,xn} be a K-compressible set. Let the K th degree 
polynomial f(x) — ^2k=o a k xk be the compressing function. Without loss of 
generality let f(xi) = x i+ i, 1 < i < N. Let = Xxi + 7, 1 < i < N, A, 7 G E. 
Denote S = {yi,y%, ■ ■ ■ , Vn}- We now show the existence of g G J\ K {Y\ K de- 
notes the vector spaces of polynomials of degree at most K) with the property 
that g(jji) = Vi+i and hence prove that S is K-compressible. In what follows, 

let 9(y)=T,k=o b kV k - 

case(i): Let 7 = 0. The only interesting case is A ^ 0. Let bk = x t-i ■ Then 

K 

9(yj) = E Xakxk j = Xx 3+ i = y?+i> 1 < j < N 

k=0 



case (ii): Let 7 = A = 1. Then for 1 < j < N - 1, 

K K k 

g{y 3 ) = E^ +1 ) fe = EEHzH 

= x>j(x>0V 

k=0 \l=k v 7 / 

Define = Y]^- k bi (^.) , 1 < k < K , ao + 1 = ^ an( l so ^ ve the linear system 

of equations to obtain bo, b\, . . . , bx as a function of ao, a±, . . . , ax- Hence, 

K 

9(yj) = E akxk j + 1 = x i +1 + 1 = y j +1 - 

k=0 
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case (iii): Let A = 1,7 ^ 0. Let Zj — Then yj = -f(zj + 1). By case(i), 
3fi G YIk w ith fi(zj) = Zj+i. Replacing Xj with Zj in case(ii), 3/2 G J} ^- with 
fi{zi + 1) = z j+i + L Replacing Xj with Zj + 1 in case (i), 3g G with 

9{l{zj + 1) = = l(zj+i + 1) = J/j+i- 



case (iv): For the general case, let zj — Xxj. Then yj = Zj + 7. By case 
(i),3/i G with fi(zj) — Zj+i- Replacing Xj with zj in case (iii), 3g G 
with 

+ 7 = j/j) = Zj+i + 7 = 
which completes the proof. □ 



3. Equivalent transformations 

We now provide a set of equivalent transformations of our problem which 
arc useful in establishing computational complexity results and in providing 
approaches to solve the problem. The K-compressibility problem can be formally 
restated as follows. Given a set S of N distinct numbers S — {x\, X2, . . . ,Xn} 
and an integer K < N — 2, does 3a G Sn (Sn denotes the symmetric group) and 
a / G Y\ K such that f{x a ^)) — 1 < i < N — 1. We would like to stress 

on this key point that if a set S is K-compressible, then elements of S should be 
producible starting at some point x G S and by repeated composition of K th 
polynomial /. In other words, / 4 (xo) £ S, < i < N — 1 and f l (xo) ^ f->(xo) if 

Immediately we observe similarities between the constraints imposed above 
and the constraints imposed in traveling salesman problem and the Hamiltonian 
path problem. If we regard each number as a city (or vertex) , the problem then is 
to find the sequence of cities (numbers) such that each city is visited exactly once 
and a certain other condition is met. We call this constraint the Hamiltonian 
path constraint. In what follows, we show how these disparate looking problems 
having similar conditions can be tied together. 

From the given set S, choose a y G 5* and assume that 3xq G S and / G YIk 
such that f N ~ 1 (xo) = y and satisfying the Hamiltonian path constraint. The 
problem then reduces to determining whether for at least one choice (out of pos- 
sible N choices) of y, our assumption holds good. Without loss of generality let 
x n = V- We then need to determine whether there exist numbers ao, a\, . . . , ax 
such that the following matrix equation 



Xi 
X2 



1 X N -i 



x\ 

~2 



„K 



l N-l. 





ao 








ai 




V2 




_a K _ 




_V N -1_ 
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is satisfied along with the Hamiltonian path constraint. The matrix equation 
can be succinctly represented as Va — v. Here v\, v-i, ■ ■ ■ , ujv-i € S. Note that 
the missing element vn is the starting element xq. For K < N — 2, the above 
set of equations form an over-determined system. The matrix V on the left of 
the equation is commonly referred to as Vandermonde matrix and is well known 
in polynomial interpolation theory. 

Using the least squares solution for a, namely a = (V T V)~ 1 V T v, v satisfies 

V(V T V)- 1 V T v = v, 

i.e, v is the eigen-vector of V(V T V)~ 1 V T with eigen-value 1. 

\Ri 



Let V = [Q1Q2] q denote the QR decomposition of V, where Q = [Q1Q2] 

is an orthogonal matrix and Ri is K+lx K + 1 upper-triangular matrix. Notice 
that V = Q1R1 and Q^Qi = I. Since rank of V is K + 1, it follows that R\ 
is full-ranked and hence invertible. Using elementary matrix algebra it can be 
shown that V(V T V)~ 1 V T = QiQj and hence v satisfies QiQjv — v, which 
can be rewritten as 

{QiQl - i)v = 0. (1) 

Let A = Q1Q1 — I. Append v N to v and add a last column of all O's to A. 
Recall that each vi € S and we need to determine whether a solution exists for 
the matrix equation Av = satisfying the Hamiltonian path constraint. 

3.1. Transformation to Exact weight perfect matching problem 

We now provide equivalences between solving the matrix equation and the 
exact weight perfect matching problem. Let us currently ignore the Hamiltonian 
path constraint and also the nature of coefficients of the matrix A and consider 
solving a more general system Av = where A is some known MxN matrix and 
each Vi G S for some known set S = {xi,X2, ■ ■ ■ , Xn}. Define a vector x € Q N 
by x = \x\,X2, ■ ■ ■ ,xn] t . Checking whether solution exist for the general system 
Av = is equivalent to verifying whether there exist a permutation matrix P 
for which APx = 0. Though this problem may have already be shown to be 
NP-complete, for the sake of completion we provide a proof here by a reduction 
from set-partition. 

Theorem 3.1. Given a M x N matrix A and a vector x G Q N , determining 
whether a permutation matrix P exist for which APx = is NP-complete. 

Proof. This problem can easily be checked to be in NP, as given the permuta- 
tion matrix P, verifying whether APx = can be done in polynomial time. 

For the NP-hardness proof, consider a special case of the set-partition prob- 
lem namely, "Given a set S of N numbers and an integer T , does there exist a 
partition of S into two subsets G and H of cardinality T and N — T respectively, 
such that the sum of elements in G equals the sum of the elements in H?" Even 
with this additional restriction on the size of the subset, the set-partition prob- 
lem is NP-complete as one can add N zeros to a given set-partition problem 
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to get a set of 2N elements and check whether this set can be partitioned into 
two subsets of cardinality N each such that the sum of the elements in one 
partition equals the sum in the other. We now give a easy reduction from this 
set-partition problem. 

Define a matrix of size 1 x N where the first T are 1 and the remaining N ~T 
entries are —1. As above define x = [xi, X2, ■ ■ ■ , xn] T where x%, X2, ■ ■ ■ , xn are el- 
ements of the given set S. Now if G = {<?i, #2, • • • , 5t} and H = {hi, hi, • • ■ , hpf-r} 
solves the set-partition problem, then consider the permutation P for which 
Px = [gfi, g2, ■ ■ ■ , <?t> h\, hi, . . . , /ijv_t] t and hence by construction of A, we 
see that APx — 0. The other direction can be proved by just reversing the 
arguments. □ 

We now reduce the above problem to the exact weight perfect matching 
problem by defining a weighted complete bipartite graph for each of the M 
constraints and then later combine these graphs into an equivalently weighted 
single complete bipartite graph. Imposing the Hamiltonian constraint amounts 
to enforcing restrictions on the perfect matching which will be done in the end. 

Consider a restatement of the above problem. Given a M x N matrix A 
and x £ Q N , we need to find whether 3n 6 Sn such that 53j=i a i,i x n(j) = 
0, 1 < i < M . Here ay denotes the entry of the matrix A. Let m.j = 

max{0, —an, —an, ■ ■ ■ , — ai,Ar} and n = max{0, —Xi, —x 2 , ■ ■ ■ , —2;^}. Let Sj = 
YljLi a i,j an d s = Ylf=i x j- Let bij = a.ij + rrii and yj = xj + n. Define 
on = nsi + rriiS + Nnrrii. By definition we have bij, yj > 0. The problem is then 
equivalent to finding a permutation ir £ Sn such that YljLi ^i,n(j)Vn(j) = Q 'i 
for 1 < i < M. With the above set up in place, each constraint can be turned 
into an exact weight perfect matching problem in a complete bipartite graph as 
follows. 

Define a complete bipartite graph G = (V,E), V = B U Y where \B\ = 
\Y\ = N. Index the vertices of B and Y as 61, bi, ■ ■ ■ , &jv and yi, y~2, ■ ■ ■ , yk 
respectively. Let e u>v € E denote the edge connecting the vertices b u and y v . 
For each one of the above M constraints, define a cost Ci{e U}V ) for the edge 
e u . v as Ci{e UyV ) = bi jU y v , 1 < i < M. The above problem can be reformulated 
as "Given this graph G with costs defined as above, does there exist a perfect 
matching PM C B x Y such that the cost of the PM , (Ci(PM)) equals on for 
alll<i< M?" 

We now transform these M exact weight perfect matching problem into 
one problem by following the lines of @, where the authors reduce the min- 
max assignment problem Q to exact weight perfect matching problem Let 
C m ax = maxi tU)V Ci(e UtV ). Consider a complete bipartite graph G with the same 
vertex set as graph G. Define the cost w(e UjV ) for the edge e u>v in G as 

M 

w(e u ,v) = X! bi,uy v {NC m ax + = h u y v , (2) 
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where 

M 

K = J2b i , a {NC max + l) i - 1 . (3) 

8=1 

Define 

M 

a = J2 a i(NC max + l) i -\ (4) 
i=i 

We now show the following. 

Theorem 3.2. G has a perfect matching PM with Ct(PM) = cti, 1 < i < M 
if and only if G has a perfect matching PM with C(PM) = a. 

Proof. For both directions of the proof, let ei j7r m, e2 iTr (2), • ■ • , £n,tt(n) for some 
7r 6 Sm constitute the edges of the perfect matching. 

case(i): Let G have a perfect matching PM with Ci(PM) — oti. We have 
Y^j=i c i( e ],n(j)) = a i> 1 < * < M - Then, 

JV N M M 

^ro(e J>(i) ) = ^^Ci(e J >(j-))( JVC '™» + = $3ai(^C maa! + l) 4 " 1 = a 

j — l j — 1 i=l i=l 



case(ii): Let G have a perfect matching PM with C(PM) = a, i.e X^i w ( e j,ir{j)) 
a. It then follows, 

iV Af iV 



J=l i=l i=l 



N 



Recall the definition of a from equation ((3]). Also notice the inequality 

N 

^2 c i( e j,ir(j)) - N C max ,\li, 
3=1 

as C m ax > Ci(e u .t>) and Ci(e u ^ v ) > by definition. By representing numbers in 

base {NC max + 1), it follows that X)j=i c i( e j,7r(j')) = a * = Gi(PM),V« which 
completes the proof. □ 

We would like to emphasize that G is polynomial time reducible from G and 
a can be represented in 0(M log C max ) bits. Thus the following result can be 
deduced from the above reduction and theorem 13. 1[ namely 



Corollary 3.3. Exact weight perfect matching is NP-complete 
which is a known result 
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3.2. Transformation to Multi- Criteria Traveling salesman problem 

Barring the Hamiltonian path constraint, we have shown how the K-compressibility 
problem can be reduced in polynomial time to the exact weight perfect match- 
ing problem by a series of transformations from the equation ([1} , as illustrated 
in section (|3.1[) . Enforcing the Hamiltonian path constraint reduces it to the 
following permutation problem. 

Define a set Z consisting of N tuples Z = < ( 1 | , ( 2 | ..... ( ^ N | 1 where 

\\vij \V2) \yNjj 

h^s are defined according to equation ([3]) and j/j = Xi+n as defined above. Recall 
that a; at is chosen to be the element for which we have assumed the existence 
of xo G S and / G J\ K such that f N ~ 1 (xo) — xn with the property that 
f l (xo) G S, < i < N — 1. In other words xn is assumed to be the last element 
of S in the orbit of / starting from xq. 

Verifying whether the given set S is K-compressible with the xn being the 
last element is equivalent to checking the existence of a permutation it G Sn 
such that 



u=l 

where a is defined as per equation^ and ir(N + 1) refers to 7r(l). This problem 
of finding the permutation tt can be transformed to a multi-criteria traveling 
salesman problem d, SE^' as illustrated below. 

Define a weighted complete directed graph G' — (V , E') consisting of N 
vertices with no self- loops. Let us label the vertices in G' as x%, x 2 , ■ . ■ , xm 
corresponding to the elements of the given set S for which we need to check 
whether it is K-compressible. Let e' u v denote the directed edge from vertex 
x u to x v . Define the weight on edge e' u v to be w(e' u v ) = h u y v . The problem 
then reduces to determining whether there exist a tour in G' such that each 
vertex is visited exactly once and the cost of the tour being exactly equal to 
a. Demeko and Woeginger Q have shown polynomial reduction of the exact 
weight perfect matching problem to the min-max assignment problem. The 
same reduction technique can be used to transform the exact cost tour problem 
to a multi-criteria traveling salesman problem [Toj], solutions to which can then 
be availed to get fast solutions to our problem. To decide whether the given set 
S is K-compressible, we just need to repeat this process over all the N possible 
choices for xjy. 

4. Computational complexity of K-compressibility 

The previous section provided approaches to solve the K-compressib ility 
problem by reducing it to the multi-criteria traveling salesman problem [id ]. 




But the above reduction doesn't throw any light on the inherent complexity of 
our problem. The problem is indeed in NP as the certificate can be the actual 
sequence in which the elements of S occurs in the orbit. Once the sequence is 
known, verifying whether a K th polynomial / can produce elements of S in that 



N 




(5) 
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sequence can easily be done in polynomial time as discussed under sectionfll]). 
But the following question still remains unanswered, "Is K- compressibility NP- 
hard?" 

Reconsider the above set of equivalent transformations discussed under sec- 
tion ([3]) where we reduced our problem to the multi-criteria traveling salesman 
problem. The multi-criteria traveling salesman problem is a well known NP- 
complete problem for an arbitrary set of weights on the edges [lpj . But if the 
edge weights are defined according to the transformations described in section 
([3])-which are some complicated functions (involving QR decomposition of the 
Vandermonde matrix) of the set elements x±, X2, ■ • • , XN~is the resultant travel- 
ing salesman problem still NP-complete? Currently, we do not have an affirma- 
tive answer to this question. But we now show that even for the simplest case 
where the edge weight w(e' u v ) for the edge e' u ^ v G E' connecting the vertices 
x u and x v in the resultant graph G'-defined in sections-is w(e' uv ) — y u Vv 
where we have replaced h u (defined in equation ([3])) with y u , the multi-criteria 
traveling salesman problem or equivalently exact cost tour traveling salesman 
problem is NP-complete. 

With the weights defined as above, namely w(e' uv ) = y u y v , the exact cost 
tour traveling salesman problem is equivalent to determining whether 3n G Sn 
such that YliLi Vir(i)yn(i+i) = a f° r some fixed cost a with the understanding 
that ir(N + 1) = 7r(l). Wc now show the problem to be NP-complete by a 
reduction from bounded-knapsack problem. 

Theorem 4.1. Given a set S = yi,yi, ■ ■ ■ ,Vn of N elements and a cost a, 
determining whether 3n G Sn such that Yli=i yTr(i)V-K(i+i) = a * s NP-complete. 

Proof. The problem can easily be seen to be in NP, as once the permutation 
7r is known, verification can be done in polynomial time. For the NP-hardness 
proof we consider reduction from bounded-knapsack problem. 

Bounded-knapsack problem is defined as follows. "Given a set of items, each 
with utmost bi copies, and a weight and profit for each of it, determine the num- 
ber of each item to include in a collection so that the total weight is less than 
a given weight limit and the total profit exceeds the given minimum profit?'' 
Let W = W2, ■ ■ . , Wn} and P = {p\,p2, ■ ■ ■ ,Pn} denote the weights and 
profits respectively. Let aw and ctp be the weight limit and profit value respec- 
tively. The bounded-knapsack problem is to determine whether the following 
inequalities 

N N 

^Piii > a p, X! Wii1 - aw 

i=l i=l 

can be satisfied with 7* G {0, 1, . . . , bi}. 

Let us consider an special case of the bounded-knapsack problem where 
Pi = Wi = Vi and bi = 2, Vi Having the weights and profits to be equal, reduces 
it to the subset-sum problem with two copies for each item. The problem is 
then meaningful only when ctp — aw — a. It reduces to checking whether 
G {0,1,2} such that J2iLiVili = a - We now show reduction from this 
NP-complete problem. 
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Without loss of generality assume that yi,a G Z + and t/j > 1 . Dchnc 
Zi = ajji. The problem is equivalent to verifying whether Si=Li z ili = a2 with 
7i € {0, 1, 2}. Let S" = {21, 22, • • ■ , z N }. Let p = 2N + 1. Append the set S' 

with - iV + 1 times and 2N + 2 0's and call the appended set S". This set S" 

2 

and the cost ^- will be the input to our problem. 

case(i): Assume that the bounded-knapsack problem has a solution ,i.e, 3^i S 
{0,1,2} such that SiLi z ili = a2 - Let T , Ti and T 2 form a partition of 5' such 
that Zi G T n if and only if 7$ = in the solution. Let 7V ra denote the cardinality 
of the subset T n whose elements are denoted by T n — {in,i,in,2, • • • ,tn,N„}- 
Consider the permutation 7r which places the elements of S" in the order 

^0,1)0, t 2 , 0, . . . , to. N , 0, t\ 1, - ,t\ 2,0, 3, -,tl 4, . . . ,0, ti ,N t -l, - ,*l,iVi)0, 

1 1 1 1 1 1 

-)t2,l, -,^2,2, ~,t2,3, ■ • ■ , ~,t2,N 2 , - , 0, - , 0, . . . , 
p P P P P P 



i.e, every to,i G To is sandwiched between two 0's, each t\j G Tj is either suc- 
ceeded or preceded by i and every ^2,i G Tg is sandwiched between two ~. By 
construction, this permutation 7r is the solution to our problem. 



case(ii): Let the permutation 7r be the solution to our problem. Since Zj > a, 
no two elements of S" can be juxtaposed in the ordering induced by 7r. Let 

if Zi is sandwiched between two 0's in the order; 
if Zi is either preceded or succeded by ^; 
if Zi is sandwiched between two | in the order. 



We have, 



Sill giT? + H = — 

p p 2 p 



where < (3 < 2N and is the sum obtained from juxtaposing i . Rearranging 
terms we get /3 = (a 2 — SiLi z ili)P- Since a, G Z and p = 2A^ + 1, /? should 
equal 0. Thus S»=i z »7» = ft2 which completes the proof. □ 

From the above theorem it easily follows that, for the general case where 
h u is arbitrary-instead of equation ©-the exact cost tour traveling salesman 
problem with weights defined as above is NP-complete. This provides a strong 
evidence that the K-compressibility problem may not have a polynomial time 
solution unless P=NP. 



4.1. Difficulty in NF '-completeness proof 

An answer to the following question might throw some light on the difficulty 
of showing the K-compressibility problem to be NP-hard. "For every K > 
2, does 3 an integer M(K), such that \/N > M(K), any K- compressible set 
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S consisting of N distinct rationals is uniquely K- compressible, i.e there exist 
utmost one compressing function f € Y\ K , f £ Yii I n other words, is it true 
that, for a given K > 2, for a sufficiently large N, any set S — {xi, ■ ■ ■ ,x N } 
with Xi G Q that is K-compressible and not 1-compressible, is uniquely K- 
compressible? It is worth emphasizing that the result is not true for K = 1. 
Since the inverse of a linear function is linear, any set S compressible by a linear 
function f(x) = ax + b is also compressible by its inverse g(x) — f~^(x) — 
with g producing the elements in the reverse order as given by /. The question 
is definitely interesting for K > 2 as it may provide insight on the uniqueness 
of the orbits of polynomial functions. 

Our hunch is that, the aforementioned claim, namely the existence of the 
bound M{K) for all K > 2, is indeed true. The verity of our claim has farfetched 
implications on the K-compressible problem being NP-complete. At this stage, 
it is worth recalling the definition of the complexity class UP which stands for 
"Unambiguous Non-deterministic Polynomial-time". The definition of this class 
is as follows: A language L belongs to UP if there exists a two input polynomial 
time algorithm A and a constant a such that 

• if x in L , then there exists a unique certificate y with \y\ = 0(|x| Q ) such 
that A(x, y) = 1. 

• if x isn't in L, there is no certificate y with \y\ — 0(\x\ a ) such that 
A(x,y) = l. 

The algorithm A verifies L in polynomial time. The crucial aspect of the class 
UP is the uniqueness of the certificate y if one exists. If the bound M(K) 
exists, then for all sets S with cardinality N > M(K), either the set is not In- 
compressible or it is uniquely K-compressible. Then the K-compressible problem 
with N > M{K) may actually belong to class UP. The question of whether 
UP = NP, is still an open problem in the theoretical computer science com- 
munity. 

For the trivial case where K = 1, we would like to prove the following result. 

Theorem 4.2. Any 1-compressible set S, with \S\ > 5, is uniquely compressible 
modulo the function inverse. 

Proof. Let S — X\,X2, ■ ■ ■ 7 xn be the given set and without loss of generality 
we can assume that x; t < Xi+\. Then the only possible ways in which the set S 
can be produced by iterating a linear function f = ax + b are 

(i) xi,X2, - ■ ■ ,xn and its reverse order xn, xn-i, ••■ ,x\, when a > 
or 

(ii) xi, Xn, X2, Xn-i, • ■ • and its reverse order • • • , Xn-i, X2, Xn, x±, when a < 
or 

(iii) Xn,Xi,Xn-i,X2, ■ ■ ■ and its reverse order • • • , X2, Xn-\, x\, Xn, when a < 0. 

For any set produced in the order (i), it is easy to see that ^^~ x% — a (by 
the definition of the slope of a line), i.e, the difference between the consecutive 
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elements either increases (for a > 1) or decreases (for a < 1) or remains con- 
stant (for a = 1). But for any set produced by in the order (ii) or (hi), the 
difference between the consecutive elements intially decreases and then steadily 
increases, specifically we have, X3 — a; 2 < X2 — x\ and xn-i — Xn-2 < xn — x^-i- 
This can be seen by defining h = f 2 = a 2 x + c (the composition of / with it- 
self), for some constant c. Then we have, h(xi) — X2,h(x 2 ) — £3, • • • , h(xN) = 
xn-i, Ji(xn-i) — The series x\, X2, •• ■ and the series xn, xn-i, ■■ ■ con- 

verges to the common point xq which is the only fixed point of h, i.e, h(xo) = xq. 
Hence the difference between the consecutive elements of S (represented in the 
ascending order) will steadily decrease for a while and then steadily increase. 
Hence it follows that for sufficiently large N, any set produced in the order (i) 
cannot be produced in the order (ii) or (iii) and vice versa. 

Now, we just need to show that for any set produced in the order (ii), 
it cannot be produced in the order (iii). To this end, let fi(x) — a\x + b\, 
produce the set S in the order (ii), namely x\,Xn, X2,Xn-i, ••• , with fi(x\) = 
xn, /i(xjv) = X2, ■ ■ ■ ■ Similarly, let f2{x) = a 2 x + &2 produce the same set 
S in the order (iii), namely Xn, xi, Xn-i, x 2 , ■ ■ ■ , with /2(xjv) = Xi,f 2 (xi) = 
Xn-i, ■ ■ ■ . Defining h(x) = f 2 o fi(x) = a 2 a\x + a 2 b\ + b 2 , we have h{x\) = 
X\,h{x2) = X2,--- ,K{xn) = Xn- Then h(x) = x,\/x, as it is the only linear 
function which can have more than one fixed point. Then f 2 = f\ ■ This 
results in a contradiction as /f 1 ^) = xn but f2(x 2 ) = xn-2 by definition. 

It is easy to check that for N > 5, the aforementioned arguments holds. □ 

It is an interesting open question to prove or disprove the existence of the 
such bound, M(K), for K > 2. We strongly believe M(K) does exist and fur- 
thermore M(K) — Q(K), i.e, varies linearly with K. We also observed in our 
experiements that, given a set S of size N, the number of possible permutations 
by which it is compressible using a K th degree polynomial drops down expo- 
nentially from N\ for K = N — 2, to one when K = 0(N). Such exponential 
drop is uncharacteristic of NP-complctc problems. 

4-2. Experimental results 

We now corroborate our claim with the following experimental results. Firstly, 
we observed that the set S = {1,2,-- - ,N~ 1, N} is one of the very few sets 
which are compressible for many different permutations of S for a given K. 
Since we did a brute force search over all the TV! possible permutations of S, 
we could conduct the experiment only for small values of N ranging from 6 to 
14. Since a K th degree polynomial subsumes the K — 1 th degree polynomial 
and the linear functions fi(x) = x + 1 and f2(%) = x — 1 can compress S, we 
can stop at the value of K where the number of possible permutations equals 
2. The results adumbrated in the table below unveil that this stopping value of 
K is not far away from N — 2. 
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Tabic 1: JV = 14 



Table 2: JV = 13 



K 


jj= permutations 




K 


# permutations 


12 


14! 




11 


13! 


11 


836644 




10 


394204 


10 


24 




9 


46 


9 


4 




8 


2 


8 


2 




7 


2 



Table 3: JV = 12 



K 


=f/= permutations 


10 


12!=479001600 


9 


23070 


8 


28 


7 


4 


6 


2 



Table 4: JV = 11 



K 


=ff permutations 


9 


11!=39916800 


8 


11798 


7 


32 


6 


2 


5 


2 



Table 5: JV = 10 



K 


# permutations 


8 


10!=3628800 


7 


1726 


6 


12 


5 


4 


4 


2 



Table 6: JV ; 



K 


# permutations 


7 


9!=362880 


6 


398 


5 


8 


4 


2 


3 


2 



Table 7: JV = 8 

K # permutations 
6~ 8!=40320 

5__68 

4 2 



Table 8: JV = 7 

K =ff permutations 
5~ 7!=5040 

j^_66 

3 2 



Table 9: JV = 6 

K # permutations 
4~ 6!=720 
3~ 8 
2 2 



From the table, it is clear that the number of permutations by which the 
given set S (compressing of integers from 1 to N) is compressible, using a K th 
degree polynomial, drops down exponentially faster from JV! for K = N — 2 to 
just 2 when K < N - 6. 

Next, for small values of K (K = 2, 3) , we did a brute force search over all the 
possible sets of a given cardinality N, consisting of integers from a broad range 
[— R, R], to verify whether the bound M(K) exist. Since we restrict ourselves 
to sets S over the rationals Q (only for which the notion of complexity is well 
defined), from theorem (12.11) it follows that, considering sets over Q is equivalent 
to considering sets over the integers Z. Without loss of generality, we can also 
assume that G S (as the elements of the set can be moved so that one of the 
element is 0) and it is the starting element from which the other numbers of the 
set are produced by iterating a polynomial function. We now illustrate in detail 
for the case K = 2. The test for K — 3 was done in the similar fashion. 

We considered all possible triplets of numbers x\ 1 X2,x^ satisfying the fol- 
lowing set of constraints, namely 

• Xi ^ xj if i ^ j. 

• x z ^0, i E {1,2,3}. 
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• -R < Xi < R,i 6 {1,2,3}. 

• The sequence 0, x\, X2, X3 cannot be produced by iterating a linear func- 
tion. 

We set R — 10, 000. We then constructed a quadratic function / = ax 2 + bx + c 
satisfying /(0) = x\, f(x\) = x^ and f{xn) = x%. Upon iterating /, we obtained 
the remaining numbers of the set S. Since the numbers of S can be as large as 
10 40 , we used the GMP package-acronymed for GNU Multiple Precision library- 
which provides arbitrary precision arithmetic. When the size of S equals 6,7 or 
8, we found no other permutation of its numbers for which the generated set 
can be produced by iterating a different quadratic function. Hence we strongly 
believe that the bound K = 2 is M(K) = 6. Similarly for K — 3, we conjecture 
that M{K) = 8. 

The experiments seems to suggest the the bound M(K) may exist even for 
other values of K. If it does exists, then the K-compressibility problem will 
be in the complexity class UP whenever N > M(K). Hence we believe that, 
unless UP — NP, it may be hard to show that the K-compressibility problem 
is infact NP-hard. 

5. e K-Compressibility of sets 

The previous sections dealt with lossless compression where the elements of 
the given set S are reproduced exactly by iterating a polynomial / of a relatively 
smaller degree K. Exact representation of a set is typically a non-requirement 
and most real world scenarios do not impose such a hard constraint. In this 
section, we formally define the notion of e K-comprcssibility and then provide the 
necessary condition that a given set S should satisfy for it to be K-compressible 
while allowing for small perturbation of its elements up to an e distance. 

We start with defining the notion of e-perturbation. 

Definition Given e > 0, y is an e-perturbation of x if \y — x\ < e|x|. 

Definition A set S — {xi, x% , . . . , xjy } consisting of TV numbers is e K-compressible 
if there exist a K-compressible set S = {xi, X2, ■ ■ • > ^jv} where Xi is the e- 
perturbation of Xi. 

Intuitively, it is clear that larger the value of e, better is the compression ratio. 
But given an e and K < N — 2, how to determine whether the set is e K- 
compressible? Closely following the lines discussed under section (|3]), we now 
provide an approach to tackle this problem. 

As the elements of the given set S can be shifted and the given e suitably 
adjusted, we can restrict ourselves to the case where the elements of the set S 
are assumed to be positive, i.e, Xi > 0,Vi. Let ii denote the e-perturbation of 
Xi. Pick a y £ S and assume its e-perturbation to be the last element in the 
orbit of a K th degree polynomial / starting from xq, i.e, f N ~ 1 (xo) = ijv, where 
Xq is an e-perturbation of some element xq £ S. Without loss of generality let 
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xn = y- Let Xi < 1 < i < N — 2. Assume e to be small enough so that 

(1 + e)xi < (1 — e)xi+x ,i.e, Xi < ii+i. The problem then amounts to finding 
the x^s such that the following over-determined Vandermonde system 



Xi 

■i-2 



7.2 



1 Xpj-l X 2 



N-l 



1 



'N-l. 





a 




Vl 




ai 








_a K _ 







has a solution which also satisfies the Hamiltonian path constraint. Each Vi is 
an e perturbation of some Xi G S and equal to one of the x'jS. vpj is the starting 
element xq. Recall that in section ^ we proceeded forward by considering 
the QR decomposition of the above Vandermonde matrix. Since x^s are by 
themselves unknown and are e perturbations of Xi, bounding the coefficients of 
the matrices Q and R as a function of e seems to be hard problem by itself. 
Hence we consider a slightly different formulation which involves computing 
determinants of Vandermonde matrices for which closed form solutions exist 
and hence the coefficients of the resultant matrix can easily be bounded. 

Finding the x^s for which the above Vandermonde system has a solution is 
equivalent to enforcing that the determinant of every K + 2 x K + 2 sub-matrix 
of the following matrix 



xi 
x 2 



x x 

~K 



Vl 

v 2 



1 ijv- 



U N-1 



%N-1 Vff-i 



is zero. Keeping the first K + 1 rows fixed and changing the K + 2 row and 
expanding w.r.t to the last column, we obtain the following N — K — 2 linear 
equations on v±, V2, ■ ■ ■ , Vjv-i namely, 



02,1^1 + a2,2^2 



CV K +2 ' 
0v K+2 



Ov K +3 
CV K +3 



Ov N -i 

Ov N -i 



0,N-K-%lVl + a N - K -2,2 v 2 H 1" OUK+2 H 1" CVN-i 







(6) 



where 5jj and c are the determinants of the following Vandermonde matrices 
respectively. 



(-1) 



K+j 



Xl 



x j+1 



xk+i 

%K+1+; 



x j+i 



T 2 

X K+1 
x K+l+i 



i-i 

~K 
X j+1 



X K+1 
x K+l+i 



16 



c = 



Xl 
X2 



i 

x 2 



x 2 



~r,K 



XK+1 ■ ■ ■ ^K+l 

For succinct representation, we can add an additional insignificant term 0*vjy to 
the left hand side of each these equations. The equations can then be represented 
in the matrix form A * v = where A is the N — K — 2 x N matrix containing 
the coefficients (with the last column of all O's) and v = [vi, v 2 , ■ ■ ■ , vn] t e 
Q N representing the variables. If we denote x = [x\,X2, ■ ■ ■ ,xn] t G Q^, the 
problem then reduces to finding a permutation P satisfying the Hamiltonian 
path constraint such that 

I*P*x = 0. (7) 

Define & N — K — 2 x N matrix A in the exact way we defined A above except 
replacing the x^s in the definition with the corresponding x[s. Notice that A 
can be pre-computed using which we can bound the entries of A as follows. 

To start with consider the determinant oi&nK+lxK + l Vandermonde 
matrix V which can be written in closed form as 

d(x) = d(x\,X2, ■ ■ ■ ,xk) = det{V) = J~] (xj - Xi). 

l<i<j<K 

Let rji = e\xi\. For small e, we can use the second order Taylor expansion to 
approximate 



(8) 



where the first partials are given by 



dd 

dxi 



d{x) fx: -4-] 

\ , _, . Xi Xfc I 



and the entries of the Hessian matrix |gf are given by 



d 2 d 

dxidxj 



(x,— x k )(xj— xi) J 2—ii=£.k [xi—x k 



if i ^ j 



if i = j 



Here £ = £2, • ■ • , £k] T where each component ^ is bounded by & < Let 
4m and d m i„ respectively denote the maxima and minima of subject to 

the condition that & < 77,. Without loss of generality we can assume Xi < x i+ i 
and for small e, Xj + 77, < Xj+i — jjj+i and hence both <i(x), d(x + £) > 0. Let 
rfmm = d(x)(l — 5i) and d max = d(x)(l + 62) where Si,S 2 > 0. Since there 
is no guarantee for the Hessian matrix to be positive semi-definite, finding the 
maxima and the minima and hence the bounding coefficients <5i and S2 , of the 
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quadratic program in equation © may be a NP-hard problem Neverthe- 
less, quadratic programming is well-studied field and the literature is inundated 
with algorithms which can efficiently compute solutions for a quadratic mini- 



mization/maximization problems [12|, |13|, [14|, [15[ . As we are only interested in 



bouding the value of the determinant, it is not necessary to exactly compute 
the maxima and minima of the quadratic program and a good approximation 
for the same will suffice. 

The above set up can now be employed to bound the entries of A using 
the entries of A. Recall that each entry of both the matrices are either zero 
or obtained as the determinant of a particular Vandermonde matrix. Running 
the quadratic program (equation©) for each non-zero entry of A, we can easily 
bound each a i: j~(i, j) th entry of ^4-by the corresponding (i,j) th entry of A, i.e, 

(1 - 7i,i,j)\<H,j\ < < (1 +72,i,i)K,j| (9) 



where 72, ij > 0. Since the entries of both A and A can be signed numbers, 
we can choose a large enough positive constant D using the above bounds and 
shift the entries of both the matrices by D so that they are strictly greater than 
zero. Denote the shifted A and A by matrices B and B respectively and its 
corresponding shifted (i,j) th entry by bij and bij. Using the bounds for a,ij 
(equation ©) we can bound each bij by 



(1 - 8i,i,j)bij < k 3 < (1 + h,i,j)Ko (10) 



where 



and 



7i,i,j"i,j 



if a,- > 



hi h3 

KiJ = { (n) 

i£a itj <0 



if aij > 



72,i,ja.,j 



-71,i,j"i 



if dij < 



Now if we use the shifted matrix B instead of A, the problem of finding the 
permutation P that solves equation© is tantamount to finding the permutation 
P such that 

B * P * x = a 

where each component dj of a equals &i = D * (j2f = i bij^j ■ Again we pick 

a large enough C max similar to the one under section (|3.1[) and define h u and 
a according to equations ([3|) and (j4]) respectively, where we replace b^ u and 
a.i with bi yU and on. Employing the series of transformation discussed under 
section©, the problem again reduces to finding the permutation ir G Sn such 
that 

JV 

^ K( u )Xtt(u+1) — " (13) 
u=l 
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where ir(N + 1) refers to 7r(l). Define a, = D * ^X^Li where &ij denote 

the (i,j) th entry of the matrix S. Define /i u and a according to equations |3]) 
and (j4|) and consider the problem of finding ir G S'jv such that 

TV 

^ ^7r(«)^7r(«+l) = 
u=l 

The bounds for bij using bij (eauation tjlOp ) can be used to bound h u in terms 
of h u as 

{1-P\,u)K <h u < (l + fc,u)K 

where 

Pk.u = ; 

h u 

for k £ {1,2} and 61, ij and 62,1, j are given by equations (fTTI) and (|T2"j) respec- 
tively. It is worth mentioning that /3k. u > and since h u and h u are both 
positive, j3i, u < 1. Let ft imax = max u {/3fc jU } for fc G {1,2}. Then it is easy to 
see that (1 — /3i, max )h u < h u < (1 + (32,max)h u . Also recollect that i a is the 
e-perturbation of x u and hence by definition (1 — e)x u < x u < (1 + e)x u . Using 
the bounds for h u and x u , for any permutation tt S Sjv, we can bound the sum 
J2 u =i ^7r(u)^7r( J1 +i) by 

N N N 

9l ^ h v (u) X Tv(u+l) < ^ ^7r(«)i7r( M +l) < 92 ^ ^7r(u)^7r(u+l) (14) 
u—1 u—1 u—1 

where 91 = (1 - /3i, max )(l - e) and 92 = (1 + /32,max)(l + e )- Similarly each 
can be bounded using a>i as (1 — 0i,j)aj < dj < (1 + 62,1)01% where 

fc G {1, 2} using which a can be bounded in terms of a as 

(1 -Pi)a < a < (1 +p 2 )a (15) 

where 

E?-i K ~ 2 O k ,iai(NC max + ly- 1 

Pk = — — ■ 

a 

From the equations (TUT) and (fT5|) . it is clear that for the permutation 7r that 
satisfies (fT3)) , 

1 w 1 , 

1 — Pi , , . 1 +P2 ,..„,. 

a < >. «7r(u)^(«+i) < a, (16) 

9i ^ 1 92 

which gives us the necessary condition for the given set S to be e K-compressible 
with the e-perturbation of Xn assumed as the last element in the orbit of a K th 
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degree polynomial. It is worth emphasizing that all the constants in the above 
equation (fTB]) can be directly computed from the given set S. Repeating this 
process for the N different choices of Xn gives us N different conditions similar in 
form to equation (TT5|) with the constants taking on different values for different 
constraints. Then the necessary condition for the set to be e K-compressible is 
the existence of a permutation it that satisfies at least one of these N constraints. 

Given a set Z = I [ 1 ) , [ 2 ],..., [ ^ N J I of N tuples and constants C\ 



K X\) \X2j \ X Nj 

and C2, the problem of finding the permutation 7r such that 

JV 



11=1 



can be transformed to the multi-criteria traveling salesman problem as discussed 
under section p.2[) and the solutions given in [8, 9, 10] can be adopted to solve 
our problem. 



6. Conclusion and open problems 

We introduced a new notion of compressibility of sets of numbers where the 
set is represented through the coefficients of smaller K th degree polynomial / 
and is produced by repeated composition of / with itself. How effective the 
compression is depends upon the inherent structure the numbers constituting 
the set exhibits. Smaller the degree of the polynomial (K) w.r.t the size of the set 
(N), higher the compression ratio. We then provided approaches to determine 
whether a given set is K-compressible by transforming it to the multi-criteria 
traveling salesman problem (TSP). The solutions developed for the multi-criteria 
TSP can then be availed to obtain solutions for our problem. Though we didn't 
formally prove it, we showed why it is unlikely for the K-compressibility problem 
to have a polynomial time solution. We then discussed about the notion of 
e K-compressibility for the case of lossy compression, where we provided the 
necessary condition that the given set should satisfy for it to be e K-compressible, 
which can be tested using the algorithms developed for multi-criteria TSP. 

A random set of N elements may not be K-compressible for all values of 
K < N — 2. Hence trying to reproduce the elements of a set by successive 
composition of smaller degree polynomial function might have limited practical 
relevance in its current form. Nevertheless, our work on the current problem 
does provide headway to solve many generalizations of it which might have 
widespread applicability. These open problems are listed in the subsequent 
section and we would like to address them in our future work. We also pose 
some open problems in dynamical system theory which are interesting from a 
core theoretical perspective. 

6.1. List of open problems 

1. Is K-compressibility NP-hard? 
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2. Given a set S of N numbers and integers m and K, can S be partitioned 
into m subsets such that each subset is K-compressible with or without 
the same compressing function? The current work provides solution for 
the case m = 1. 

3. Can K-compressibility be extended for other class of functions like piece- 
wise linear functions or even splines? Does this class of function exhibit 
similarity to the problem defined above? Is "partitioning" the function 
equivalent to partitioning the set? 

4. Given a set S of N numbers and integers m and K, can S be made In- 
compressible by addition of utmost m numbers to it? 

5. For every K > 2, does 3 integer M(K), such that VW > M(K), any 
K-compressible set S consisting of N distinct numbers is uniquely In- 
compressible, i.e there exist utmost one compressing function / 6 J| K , f ^ 

11:' 

6. For every K, does 3 an integer U(K), such that for any set S with cardi- 
nality N > U{K), the number of bijective mappings from S to S defined 
using / G Y[k ^ s utmost polynomial in N and K? If this is true, then 
enforcing the Hamiltonian path constraint-defined in section (j3]) -may be 
unnecessary if N > U(K). 
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